首页> 外文OA文献 >Dimension Projection among Languages based on Pseudo-relevant Documents for Query Translation
【2h】

Dimension Projection among Languages based on Pseudo-relevant Documents for Query Translation

机译:基于伪相关文档的语言维度投影   用于查询翻译

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Using top-ranked documents in response to a query has been shown to be aneffective approach to improve the quality of query translation indictionary-based cross-language information retrieval. In this paper, wepropose a new method for dictionary-based query translation based on dimensionprojection of embedded vectors from the pseudo-relevant documents in the sourcelanguage to their equivalents in the target language. To this end, first welearn low-dimensional vectors of the words in the pseudo-relevant collectionsseparately and then aim to find a query-dependent transformation matrix betweenthe vectors of translation pairs appeared in the collections. At the next step,representation of each query term is projected to the target language and then,after using a softmax function, a query-dependent translation model is built.Finally, the model is used for query translation. Our experiments on four CLEFcollections in French, Spanish, German, and Italian demonstrate that theproposed method outperforms a word embedding baseline based on bilingualshuffling and a further number of competitive baselines. The proposed methodreaches up to 87% performance of machine translation (MT) in short queries andconsiderable improvements in verbose queries.
机译:使用顶级文档来响应查询已被证明是一种有效的方法,可以提高基于词典的跨语言信息检索查询翻译的质量。本文提出了一种新的基于字典的查询翻译方法,该方法基于从源语言中的伪相关文档到目标语言中等效文本的嵌入矢量的维投影。为此,首先分别学习伪相关集合中单词的低维向量,然后旨在找到出现在集合中的翻译对向量之间的查询相关转换矩阵。下一步,将每个查询词的表示形式投影到目标语言,然后在使用softmax函数之后,构建依赖于查询的翻译模型。最后,将该模型用于查询翻译。我们对法文,西班牙文,德文和意大利文的四个CLEF集合进行的实验表明,所提出的方法优于基于双语混排和更多竞争基准的词嵌入基准。所提出的方法在短查询中可达到高达87%的机器翻译(MT)性能,并且在详细查询中也有相当大的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号